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Input file Fbh2786c.seq; Output File 2786. trans 
Sequence length 2459 

1.0 20 30 4.0 50 60 M A c G 4 

GCGGCCGCGTCGACCTCCCCTCGGGTTCGCGGCCCGGCCGGTGAGCMCGGCTCTGCGGCC ATG GCG AGC GGC 12 73 

EHSPGSGAARRPIHS aH A V D 24 

GAG CAT TCC CCC GGC AGC GGC GCG GCC CGG CGG CCG CTG CAC TCC GCG CAG GCT GTG GAC 72 133 

VASASNFRAFELLHLHLDLR 44 

GTG GCC TCG GCC TCC MC TTC CGG GCC TTT GAG CTG CTG CAC TTG CAC CTG GAC CTG CGG 132 193 

AEFGPPGPGAGSRGLSGTAV 64 
GCT GAG TTC GGG CCT CCA GGG CCC GGC GCA GGG AGC CGG GGG CTG AGC GGC ACC GCG GTC 192 253 

LDLR-CLEPEGAAELRLDSHP 84 
CTG GAC CTG CGC TGC CTG GAG CCC GAG GGC GCC GCC GAG CTG CGG CTG GAC TCG CAC CCG 252 313 

C L EVTAAALRRERPGSEEPP 104 
TGC CTG GAG GTG ACG GCG GCG GCG CTG CGG CGG GAG CGG CCC GGC TCG GAG GAG CCG CCT 312 373 

AEPVSFYTQPFSHYGQALCV 124 
GCG GAG CCC GTG AGC TTC TAC ACG CAG CCC TTC TCG CAC TAT GGC CAG GCC CTG TGC GTG 372 433 

SFPQPCRAAERLQVLITYRV 144 
TCC TTC CCG CAG CCC TGC CGC GCC GCC GAG CGC CTC CAG GTG CTG CTC ACC TAC CGC GTC 432 493 

GEGPGVCWLAPEQTAGKKKP 164 
GGG GAG GGA CCC GGG GTT TGC TGG TTG GCT CCC GAG CAG ACA GCA GGA AAG MG AAG CCC 492 553 

FVYTQGQAVLNRAFFPCFDT 184 
TTC GTG TAC ACC CAG GGC CAG GCT GTC CTA AAC CGG GCC TTC TTC CCT TGC TTC GAC ACG 552 613 

PAVKYKYSALIEVPDGFTAV 204 
CCT GCT GTT AAA TAC AAG TAT TCA GCT CTT ATT GAG GTC CCA GAT GGC TTC ACA GCT GTG 612 673 

MSASTWEKRGPNKFFFOMCO 224 
ATG AGT GCT AGC ACC TGG GAG AAG AGA GGT CCA AAT AAG TTC TTC TTC CAG ATG TGT CAG 672 733 

PIPSYLIALAIGDLVSAEVG 244 
CCC ATC CCC TCC TAT CTG ATA GCT TTG GCC ATC GGA GAT CTG GTT TCG GCT GAA GTT GGA 732 793 

PRSRVWAEPCLIDAANEEYN 264 
CCC AGG AGC CGG GTG TGG GCT GAG CCC TGC CTG ATT GAT GCT GCC AAT GAG GAG TAC AAC 792 853 

GVIEEFLATGEKLFGPYVWG 284 
GGG GTG ATA GAA GAA TTT TTG GCA ACA GGA GAG AAG CTT TTT GGA CCT TAT GTT TGG GGA 852 913 

RYDLLFMPPSFPFGGMENPC 304 
AGG TAT GAC TTG CTC TTC ATG CCA CCG TCC TTT CCA TTT GGA GGA ATG GAG AAC CCT TGT 912 973 

LTFVTPCLLAGDRSLADVII 324 
CTG ACC TTT GTC ACC CCC TGC CTG CTA GCT GGG GAC CGC TCC TTG GCA GAT GTC ATC ATC 972 1033 

HE ISHSWFGNLVTNANWGEF 344 

CAT GAG ATC TCC CAC AGT TGG TTT GGG AAC CTG GTC ACC AAC GCC AAC TGG GGT GAA TTC 1032 1093 

WLNEGFTMYAQRRISTILFG 364 
TGG CTC AAT GAA GGT TTC ACC ATG TAC GCC CAG AGG AGG ATC TCC ACC ATC CTC TTT GGC 1092 1153 

AAYTCLEAATGRALLROHMD 384 
GCT GCG TAC ACC TGC TTG GAG GCT GCA ACG GGG CGG GCT CTG CTG CGT CM CAC ATG GAC 1152 1213 

ITGEENPLNKLRVKIEPGVD 404 

ATC ACT GGA GAG GAA AAC CCA CTC AAC AAG CTC CGC GTG AAG ATT GAA CCA GGC GTT GAC 1212 1273 

PDDTYNETPYEKGFCFVSYT, 424 

CCG GAC GAC ACC TAT AAT GAG ACC CCC TAC GAG AAA GGT TTC TGC TTT GTC TCA TAC CTG 1272 1333 

AHLVGDQDQFDSFLKAYVHE 444 

GCC CAC TTG GTG GGT GAT CAG GAT CAG TTT GAC AGT TTT CTC AAG GCC TAT GTG CAT GAA 1332 1393 

FKFRSILADDFLDFYLEYFP 464 

TTC AAA TTC CGA AGC ATC TTA GCC GAT GAC TTT CTG GAC TTC TAC TTG GAA TAT TTC CCT 1392 1453 

FIG. 1A. 
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GAG CTT MG AM AAG AGA GTG GAT ATC ATT CCA GGT TTT GAG TTT GAT CGA TGG CTG AAT 1452 1513 

TPGWPPYLPDLSPGDSLMKP 504 

ACC CCC GGC TGG CCC CCG TAC CTC CCT GAT CTC TCC CCT GGG GAC TCA CTC ATG AAG CCT 1512 1573 

AEELAQLWAAEELDMKAIEA 524 

GCT GAA GAG CTA GCC CAA CTG TGG GCA GCC GAG GAG CTG GAC ATG AAG GCC ATT GAA GCC 1572 1633 

VAISPWKTYQLVYFLDKILO 544 

GTG GCC ATC TCT CCC TGG AAG ACC TAC CAG CTG GTC TAC TTC CTG GAT AAG ATC CTC CAG 1632 1693 

KSPLPPGNVKKLGDTYPSIS 564 

AAA TCC CCT CTC CCT CCT GGG AAT GTG AAA AAA CTT GGA GAC ACA TAC CCA AGT ATC TCA 1692 1753 

NARNAELRLRWGQIVLKNDH 584 

AAT GCC CGG AAT GCA GAG CTC CGG CTG CGA TGG GGC CAA ATC GTC CTT AAG AAC GAC CAC 1752 1813 

QEDFWKVKEFLHNQGKQKYT 604 

CAG GAA GAT TTC TGG AAA GTG AAG GAG TTC CTG CAT AAC CAG GGG AAG CAG AAG TAT ACA 1812 1873 

LPLYHAMMGGSEVAOTLAKE 624 

CTT CCG CTG TAC CAC GCA ATG ATG GGT GGC AGT GAG GTG GCC CAG ACC CTC GCC AAG GAG 1872 1933 

TFASTASQLHSNVVNYVOOI 644 

ACT TTT GCA TCC ACC GCC TCC CAG CTC CAC AGC AAT GTT GTC AAC TAT GTC CAG CAG ATC 1932 1993 

V A P K G S * 651 

GTG GCA CCC AAG GGC AGT TAG 1953 

-2011 ^ 
AGGCTCGTGTGCATGGCCCCTGCCTCTTCAGGCTCTCCAGGCTTTCAGAATAATTGTTTGTTCCCAAATTCCTGTTCCC 
TGATCAACTTCCTGGAGTTTATATCCCCTCAGGATAATCTATTCTCTAGCTTAGGTATCTGTGACTCTTGGGCCTCTGC 
TCTGGTGGGAACTTACTTCTCTATAGCCCACTGAGCCCCGAGACAGAGAACCTGCCCACAGCTCTCCCCGCTACAGGCT 
GCAGGCACTGCAGGGCAGCGGGTATTCTCCTCCCCACCTAAGTCTCTGGGAAGAAGTGGAGAGGACTGATGCTCTTCTT 
TTTTCTCTTTCTGTCCTTTTTCTTGCTGATTTTATGCAAAGGGCTGGCATTCTGATTGTTCTTTTTTCAGGTTTAATCC 
TTATTTTAATAAAGTTTTCAAGCAAAAATTAAAAAAAAAAAAAAAAAAAA 



FIG. 1B. 
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Prosite Pattern Matches for 2786 

Prosite version: Release 12.2 of February 1995 

>PS00004/PDOC00004/CAMP_PHOSPHO_SITE cAMP- and cGMP-dependent protein kinase phosphorylation site. 
Query: 356 RRIS 359 

>PS00005/PDOC00005/PKC_PHOSPHO_SITE Protein kinase C phosphorylation site. 
Query: 141 TYR 143 
Query: 374 TGR 376 

>PS00006 /PDOC00006/CK2 PH0SPH0_SITE Casein kinase II phosphorylation site. 
Query: 208 STWE 211 

Analysis of 2786 

Query: 318 SLAD 321 
Query: 368 TCLE 371 
Query: 386 TGEE 389 
Query: 408 TYNE 411 
Query: 412 TPYE 415 
Query: 496 SPGD 499 

>PS0008/PDOC00008/MYRISTYL N-myristoylation site. 

Query: 9 GSGAAR 14 

Query: 58 GLSGTA 63 

Query: 119 GQALCV 124 

Query: 333 GNLVTN 338 

Query: 364 GAAYTC 369 

Query: 614 GSEVAQ 619 

>PS00009/PDOC00009/AMIDATION Amidalion site. 
Query: 159 AGKK 162 

>PSQ0030 /PDOC000307RNP_1 Eukaryotic putative RNA-binding region RNP-1 signature. 
Query: 416 KGFCFVSY 423 

>PS00142/PDOC00129/ZINC_PROTEASE Neutral zinc metallopeptidases, zinc-binding region signature. 
Query: 322 VIIHEISHSW 331 

FIG. 4. 
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